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Abstract 



The first measurement of the cross section for top-quark pair production in pp col- 
lisions at the LHC at center-of-mass energy a/s = 7 TeV has been performed us- 
ing 3.1 ± 0.3 pb^^ of data recorded by the CMS detector. This result utilizes the 
final state with two isolated, highly energetic charged leptons, large missing trans- 
verse energy, and two or more jets. Backgrounds from Drell-Yan and non-W/Z bo- 
son production are estimated from data. Eleven events are observed in the data 
with 2.1 ± 1.0 events expected from background. The measured cross section is 
194 ± 72(stat.) ± 24(syst.) ± 21(lumi.) pb, consistent with next-to-leading order pre- 
dictions. 
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Since its discovery dill, the properties of the top quark have been subject to numerous detailed 
studies [3J, which until recently have only been possible at the Tevatron proton-antiproton col- 
lider. With the advent of the Large Hadron Collider (LHC) era [4J, top-quark processes can be 
studied for the first time in multi-TeV proton-proton collisions. In both pp and pp collisions, 
top quarks are expected to be produced primarily via the strong interaction in top-antitop (tt) 
pairs. At the LHC, the tt production mechanism is expected to be dominated by a gluon fu- 
sion process, whereas at the Tevatron, top-quark pairs are predominantly produced through 
quark-antiquark annihilation. Measurements of top-quark production at the LHC are there- 
fore important new tests of our understanding of the tt production mechanism. This is a crucial 
component of the early LHC physics program, since many signatures of new physics models 
accessible at the LHC either suffer from top-quark production as a significant background or 
contain top quarks themselves. 

In this Letter we present the first measurement of the cross section for tt production in proton- 
proton collisions at the LHC at center-of-mass energy \Js = 7 TeV. The results are based on a data 
sample corresponding to an integrated luminosity of 3.1 ± 0.3 pb^^ [5] recorded by the CMS 
experiment [6J between March and August 2010. This measurement is an important milestone 
for CMS, demonstrating the experiment's capabilities in extracting an intricate signature. 

Within the standard model, the top quark decays via the weak process t — > Wb almost exclu- 
sively. Experimentally, top-quark pair events are categorized according to the decay of the two 
W bosons: the all-hadronic channel, in which both W bosons decay into quarks; the lepton+jets 
channel, in which one W boson decays leptonically the other into quarks; and the dilepton 
channel, in which both W bosons decay into leptons. The measurement described herein is 
performed using the e+e^, and e^fi^ dilepton tt modes. Therefore, the final state stud- 

ied in this analysis contains two oppositely charged leptons (electrons and muons, including 
taus subsequently decaying to electrons and muons), two neutrinos from the W boson decays, 
and at least two jets of particles resulting from the hadronization of the b quarks. 

The central feature of the CMS apparatus is a superconducting solenoid, 13 m in length and 
6 m in diameter, which provides an axial magnetic field of 3.8 T. The bore of the solenoid is 
outfitted with various particle detection systems. Charged particle trajectories are measured 
by the silicon pixel and strip tracker, covering < (p < In in azimuth and \f]\ < 2.5, where 
the pseudorapidity rj is defined as tj = — In [tan 0/2], with 9 being the polar angle of the tra- 
jectory of the particle with respect to the beam direction. A crystal electromagnetic calorimeter 
(ECAL) and a brass /scintillator hadronic calorimeter (HCAL) surround the tracking volume; 
in this analysis the calorimetry provides high-resolution energy and direction measurements 
of electrons and hadronic jets. Muons are measured in gas detectors embedded in the steel 
return yoke outside the solenoid. The detector is nearly hermetic, allowing for energy balance 
measurements in the plane transverse to the beam directions. A two-tier trigger system selects 
the most interesting pp collision events for use in physics analysis. A more detailed description 
of the CMS detector can be found elsewhere t6j. 

The trigger providing the data sample used in this analysis is based on the presence of at least 
one charged lepton, either an electron or a muon, with a minimum transverse momentum 
Pt of 9 (15) GeV/c for the muon (electron). This data sample is used both for the selection 
of the signal and for signal-depleted control regions used for studies related to background 
processes. Simulated signal events that pass the event selection, as described below, satisfy the 
trigger requirements with an efficiency above 97% in the decay mode and above 99% in 

the other two modes, in agreement with estimates from Z boson events in the data. 

Before further consideration, events are required to have at least one good reconstructed pp in- 
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teraction vertex ||7|. Among these events, selection criteria are applied to reconstructed objects 
to identify candidates consistent with dilepton tt processes. 

Muon candidates are reconstructed HI using two algorithms that require consistent hits in the 
tracker and muon systems: one is an algorithm based on the matching of extrapolated trajec- 
tories from the silicon tracker to hits in the muon system (tracker-based muons); the second is 
an algorithm based on performing a global fit of consistent hits in the tracker and the muon 
system (globally-fitted muons). Candidates are required to have pj > 20 GeV/c and \r]\ < 2.5. 
Additionally, the track associated with the muon candidate is required to have a minimum 
number of hits in the silicon tracker, to be consistent with originating from the beam spot, and 
to have a high-quality global fit including a minimum number of hits in the muon detector. 

Electron candidates are reconstructed 191 starting from a cluster of energy deposits in the crys- 
tals of the ECAL, which is then matched to hits in the silicon tracker, used to initiate a special 
track reconstruction algorithm. The electron reconstruction algorithm takes into account the 
possibility of significant energy loss of the electron through bremsstrahlung as it traverses the 
material of the tracker. Electron candidates are required to have pj > 20 GeV/c and pseudo- 
rapidity \tj\ < 2.5. The electron candidate track is required to be consistent with originating 
from the beam spot. Requirements on the values of electron identification variables based 
on shower shape and track-cluster matching are applied to the reconstructed candidates; the 
criteria are optimized in the context of the inclusive W — )■ ev selection and are designed to 
maximize the rejection of electron candidates from QCD multijet production while maintain- 
ing 90% efficiency for electrons from the decay of W/Z bosons. Electron candidates within 
AR = Y^A(/)2 + < 0.1 of a tracker-based or globally-fitted muon are rejected to remove 
fake electron candidates due to muon bremsstrahlung. In addition, electrons consistent with 
anomalous depositions in the ECAL or with photon conversions are rejected. 

Charged leptons from the decay of W bosons are expected to be isolated from other activity 
in the event. For selected muon and electron candidates, a cone of AR < 0.3 is constructed 
around the track direction at the origin and the scalar sum of the track transverse momenta and 
calorimeter energy deposits projected onto a plane transverse to the beam is calculated. The 
contribution from the candidate itself is excluded. If the value of this scalar sum is more than 
15% of the candidate's transverse momentum, the candidate is considered to be non-isolated 
and is rejected. 

Lepton trigger, identification, and isolation efficiencies are measured using inclusive Z events 
from data and are compared with simulation. All comparisons show good agreement, gener- 
ally within 2%. The residual differences between the efficiencies estimated in data and simula- 
tion are treated as systematic uncertainties. 

Events are required to have at least one pair of oppositely charged leptons. Both charged lep- 
tons are required to originate from within 1 cm along the beamline of the reconstructed pp in- 
teraction location. To veto contributions from Z production, the invariant mass of the dilepton 
system, Mpj, is required to be outside a ±15 GeV/c'^ window centered at the mass of the Z boson 
for the e+e^ and }i^H^ modes. Additionally, dilepton candidate events with Mu < 10 GeV/c^ 
are removed, at essentially no penalty for the collected signal. 

The neutrinos from the W boson decays do not interact with the detector and escape without 
depositing any of their energy. The presence of a neutrino manifests itself as an imbalance in 
the measured energy depositions; the imbalance in the projection perpendicular to the beam 
line (missing transverse energy, ^j) is an important distinguishing feature of tt events in this 
channel. At CMS there are several techniques for calculating |10|; here, the raw £7, calcu- 
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lated from calorimeter signals, is made more accurate through a series of correctioris taking into 
account the contribution from the minimally interacting muons and, most importantly, a per- 
track correction for the expected imperfect response of the calorimeter. This track correction 
results in an improved energy resolution, especially for low-energy charged particles. Neither 
the dominant background processes, Drell-Yan Z/7* — > e+e^ and , nor the difficult-to- 
model background from isolated lepton candidates produced in QCD multijet events, contain 
a natural source of large £r. Hence, in the e+e^ and modes, f^j > 30 GeV is required; in 

the e^ji^ mode a looser requirement of f^j > 20 GeV is used due to the significantly smaller 
contribution of Drell-Yan background. 

Dilepton tt events will have at least two hadronic jets from the hadronization of the two b 
quarks. The anti-A;^ clustering algorithm [llj with R = 0.5 is used for jet clustering. Jets are 
reconstructed using calorimeter information and corrected using reconstructed tracks [12J. Fur- 
ther corrections are applied to the raw jet momenta to establish a relative uniform response of 
the calorimeter in jet tj and an absolute uniform response in jet pr- The jet energy scale uncer- 
tainty for these track-corrected jets is 5%. Jet candidates are required to have pj > 30 GeV/c, 
< 2.5, and must not overlap with any electron or muon candidate within AR < 0.4. Events 
with fewer than two jets are discarded. 

The selection efficiency of signal events is evaluated in a simulated tt event sample modeled 
with the MadGraph event generator [13J with up to three additional hadronic jets. The 
events are subsequently processed with PYTHIA [141 to provide showering of generated par- 
ticles, and then processed with a full CMS detector simulation based on GEANT4 1151 . The 
total next-to-leading order (NLO) cross section for top-quark pair production used here to 
scale simulated signal distributions is ct-jj = 157.5^24 4 P^' obtained with MCFM IIT6I [171 . 
Approximate next-to-next-to-leading order (NNLO) calculations for the tt cross section have 
been completed (see for example ||l8jj23il ) but are not used here. The theoretical uncertainty 
on the cross section includes the scale uncertainties, determined by varying the factorization 
and renormalization scales by factors of 2 and 0.5 around the central scale, corresponding to 
the top quark mass (172.5 GeV/c^), and the uncertainties from the parton distribution functions 
(PDFs) and the value of as, following the procedures from the MSTW2008 [24], CTEQ6.6 [|25J . 
and NNPDF2.0 [26J sets. The expected yield of events passing the selection criteria, assuming 
the NLO production cross section, is 1.5 ± 0.3, 1.7 ± 0.3, and 4.5 ± 0.9 for the e+e^, }i^J^^ , and 
e^jx"^ decay modes, respectively. The uncertainties on these predicted event yields combine 
the systematic uncertainties on the event selection, the theoretical production cross section, 
and the integrated luminosity of the sample, where the contribution from the last two sources 
dominates the total. Note that the simulated tf signal sample used for the estimate of the ex- 
pected signal events was generated with the W — >• £v branching fraction set to 1 /9, which is not 
consistent with the standard value (0.1080 ± 0.009) l|27|l used in the cross section measurement. 

The selected sample is not 100% pure in dilepton tt events. There are two types of background 
estimation techniques used in this analysis. One strategy utilizes simulated pp collision events 
to model background processes. There are, however, some pathological backgrounds that are 
harder to model accurately. In such cases, it is preferred to estimate the yields of these events 
from the data. 

Contributions from diboson production (VV, where V = W or Z/ 7*), based on a leading-order 
production cross section of o\rv = 4.8 pb [13J, and electroweak single-top production in the 
tW channel (t/jw = 10.6 pb f2S\) are modeled with the MadGraph event generator and are 
processed in an equivalent fashion as the simulated tt sample used to assess the signal yield. 
The Drell-Yan Z/7* -> tt process {crz/y*^TT = 1666 pb [|29l) is modeled with pythia and 
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Table 1 : The expected number of dilepton tt signal and background events passing the full selection 
criteria, compared to the number of observed events. The procedures for estimating the expected num- 
bers of events and their uncertainties are described in the text. The expected signal yield assumes a tf 
cross section of a^i = 157.5^24 4 pt"- 



Source 


Number of events 


Expected tt 


7.7 ± 1.5 


Dibosons (VV) 


0.13 ± 0.07 


Single top (tW) 


0.25 ± 0.13 


Drell-YanZ/7* t+t~ 


0.18 ± 0.09 


Drell-Yan Z/j* — > e+e^, H^F^ 


1.4 ±0.5 ±0.5 


Events with non-W/ Z leptons 


0.1 ±0.5 ±0.3 


Total backgrounds 


2.1 ± 1.0 


Expected total, including tf 


9.8 ±1.8 


Data 
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MadGraph. The uncertainties on these production cross sections are well within the total 
systematic uncertainty of 50% used for each of these backgrounds. Table[l]gives the simulation- 
based predictions for the event yields from these processes. 

The contributions from two important background sources are estimated from the data: excep- 
tional Drell-Yan events that evade the Z veto and are accompanied by significant missing trans- 
verse energy; and non-W/Z isolated lepton signatures from multijet and W+jets production. 
Difficult-to-simulate instrumental effects influence both topologies and hence it is preferable to 
use calibration samples from the data in these estimations. 

The events rejected by the Z veto are used to estimate the residual contributions from Drell- 
Yan Z/ 7* — > e^e^ and in the surviving selected sample. In the final state the rate 
of events surviving the Z veto is equal to an estimate of the Drell-Yan contribution near the 
Mi( peak, scaled by the expected ratio of off-peak to near-peak events derived from simula- 
tion. The near-peak Drell-Yan Z/ 7* contribution is estimated from the number of all events 
triggering the Z veto, after subtraction of the non-Drell-Yan contribution estimated from e^}V^ 
events passing the same selection and corrected for the differences between the electron and 
muon identification efficiencies. The estimate in the e+e^ mode is done in a similar fashion; the 
summed contribution is shown in Tablejl] The systematic uncertainty of this method, evaluated 
in each mode separately, is estimated to be 50%. This is dominated by detector calibration ef- 
fects and changes of the fraction of Z-vetoed Drell-Yan Z/7* events with increasingly stringent 
requirements (additional jets and missing transverse energy) as estimated from simulation. 

The contributions to the selected sample from isolated lepton candidates from non-W/ Z decays 
are also derived from data. Such lepton candidates mostly arise from jets that are able to satisfy 
the tight lepton identification criteria. A superset of dilepton candidate events is chosen by 
loosening the lepton identification criteria in the trigger samples used for the measurement. 
The number of these candidates passing the loosened selection criteria from non-W/ Z leptons 
can be weighted by the ratio of yields of tight-to-loose lepton candidates (Rtl) to produce 
an estimate of non-W/Z leptons passing the tight selections. The ratio K^l is measured as a 
function of candidate transverse momentum and pseudorapidity in a multijet-dominated data 
sample containing events with one lepton candidate passing loose selection criteria. Additional 
selection criteria are applied to suppress the residual contribution to the loose lepton sample 
from electroweak processes. We assume this Rtl is appropriate for use in the dilepton signal 
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Figure 1: Number of jets in events passing all dilepton selection criteria before the >2-jet requirement 
for all three dilepton modes combined, compared to signal and background predictions. The hatched 
bands reflect the uncertainties on the background predictions. 



sample, and we also consider Rji^ to be independent from the other lepton in events with two 
leptons. Possible differences are assessed from comparisons of Rtl measured in QCD multijet 
data samples with different jet thresholds and flavor content. 

Estimates for the contributions from lepton candidates in pure multijet QCD, with two such 
non-W/Z candidates, and in W+jets, with one such candidate beyond that from the decay of 
the W, are derived separately. A sample of loose dilepton events both failing the tight selections 
is used to estimate the multijet QCD contribution. Loose dilepton events with only one lepton 
failing the tight requirements include contributions from W+jets events, but are contaminated 
by multijets and leptons from W/Z decays. The multijet QCD contamination is subtracted 
using the previous estimate, while the contamination from W/ Z leptons is measured from a 
sample of Z events fulfilling loose selection requirements. 

The prediction for these non-W/Z leptons is shown in Table [T] The systematic uncertainty on 
the non-W/Z lepton estimate is primarily from differences in the jet momentum spectrum and 
flavor composition between the QCD-dominated sample in which Kji^ is measured and the 
sample where it is applied. Other subdominant contributions to the systematic uncertainties 
include the Rtl measurement biases due to electro weak signal contribution, the dissimilarity 
in the trigger between the Rjl calibration sample and the signal sample to which it is applied, 
and from the statistical limitations on the Kji calibration sample. The systematic uncertainty 
on the electron Rji^ is 50%, which corresponds to a 50% (100%) uncertainty on a raw estimate of 
the W+jets (QCD multijets) non-W/Z isolated lepton contribution, prior to accounting for the 
signal contribution to the estimate. Similarly, the systematic uncertainty on the muon Rji^ is 
-100°/°' which corresponds to a ^^qq/o ( l|oo°'^) uncertainty on the estimate of the W+jets (QCD 
multijets) non-W/Z isolated lepton contribution. 

Expected yields from simulated signal and background processes, normalized to estimates 
from data where appropriate, are shown in Fig. [T] as a function of jet multiplicity for events 
satisfying the complete dilepton event selection criteria except the >2-jet requirement; the tt 
signal dominates the bins with at least two jets. 

Eleven dilepton events (3 e+e^, 3 H^fi^, 5 e^f/"^) are observed in the data after applying the 
event selection criteria, with a total of 2.1±1.0 background events expected. We attribute the 
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Figure 2: Distribution of the top-quark mass using two different reconstruction methods fSO^ 'ST'l, com- 
pared with the expected yields from simulated signal-plus-background and background-only hypothe- 
ses. The points in each bin for the two methods are slightly offset in reconstructed mass to allow coinci- 
dent points to be visible. The last bin contains the overflow. 



excess of events above the background expectation to top-quark pair production. 

The top-quark mass reconstruction methods of |30| (KIN, i.e., KINematic, method) and ||3T] 
(MWT, i.e.. Matrix-element Weighting Technique) are applied to the selected events. In both 
methods, numerical solutions to the kinematic equations appropriate for a tt decay with two 
charged leptons in the final state are found for each event. The solutions are based on an ensem- 
ble of values of jet momenta and missing energy, generated corresponding to their expected 
resolution around the measured values. In the KIN method the underconstrained system is 
solved by introducing an additional constraint on the longitudinal momentum of the tt system, 
whose probability distribution is expected to have a negligible dependence on the top-quark 
mass and is therefore assumed from simulation. The top-quark mass value corresponding to 
the largest number of solutions is the reconstructed mass for each event. In the MWT method 
the system is solved for a range of top-quark mass values, and weights are assigned based on 
the likelihood of each solution. The solution with the largest weight is used as the mass estima- 
tor. Figure |2] shows that the kinematics of the selected events are statistically compatible with 
predictions based on a top-quark mass of 172.5 GeV/c^, demonstrating the consistency of the 
selected sample with top-quark pair production. 

Further, beyond the complete event selection described above, the property that the two jets 
expected in dilepton tt events both originate from b quarks is exploited to further confirm the 
top-quark signal. A b-quark jet identification algorithm that relies on the presence of charged 
particle tracks displaced from the primary pp interaction location, as expected from the decay 
products of long-lived b hadrons [32 1, is used. A jet is identified to be from a b quark if there 
are at least two tracks satisfying a minimum impact parameter significance requirement. The 
efficiency of this algorithm for a b-quark jet in dilepton tt signal events is about 80% with a 
10% false positive rate, as estimated in simulated QCD multijet events with no b quarks. This 
algorithm is applied to events passing all the selection criteria. The multiplicity of jets satis- 
fying these b-tagging criteria in events passing the full dilepton event selection is shown in 
Fig. |3] Although not used directly in the cross section extraction, the b-tag multiplicity pro- 
vides additional support for the hypothesis that the selected data are consistent with dilepton 
tt production. 
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Figure 3: Number of b-tagged jets in events passing all dilepton selection criteria for all three dilep- 
ton modes combined, compared to signal and background predictions. The hatched bands reflect the 
expected uncertainties on the b-tag efficiency for signal events. 



The top-quark pair production cross section is determined from the ratio of the number of 
observed events in the data after background subtraction with the product of the signal ac- 
ceptance, selection efficiency, the branching fractions, and the integrated luminosity. From the 
simulated ft sample, the acceptance times efficiency is found to be (23.0 ± 1.4)% for events con- 
tributing to the e^e^, }i^H^ , and e^^^ modes combined. The total branching fraction for tf 
to the three modes of our final state is (6.45 ±0.11)% p7|. The systematic uncertainty on the 
acceptance times efficiency is described below. 

Various sources of systematic uncertainty related to the event selection have been evaluated. 
The systematic uncertainty assigned to the dilepton selection efficiency is 4.4%, obtained from 
a comparison of Z events in data and simulation, together with half of the difference between 
the efficiencies obtained in simulated Z and tf events. The effect of multiple pp interactions in a 
single beam crossing — an effect that is present in the data but not in these simulated samples 
— is included in this uncertainty. The systematic uncertainty due to the reconstruction of jets 
and missing transverse energy is estimated by varying the jet energy scale by ±5%, simulta- 
neously with a ±5% variation in the hadronic part of the missing transverse energy, resulting 
in a value of 3.7%. Uncertainties on the simulation of the signal selection include the amount 
of QCD radiation, hadron and tau decay modeling, and the W leptonic branching fraction; 
these sources combined give a systematic uncertainty of 2.8%. Other sources of systematic un- 
certainty pertaining to the signal, including uncertainties in the parton distribution functions 
inside the colliding protons and the effect of additional minimum bias interactions in the signal 
selection, are neglected because they were found to have a relatively small impact. The overall 
systematic uncertainty on the total ft cross section from the above sources is 6.4%. 

The background contributions from single-top, diboson, and Drell-Yan Z/7* — > t+t^ pro- 
cesses shown in Table [l] are obtained from simulation and found to be small compared to the 
total event yield. Each of these backgrounds is assigned a 50% systematic uncertainty. The 
contributions from Drell-Yan e+e^ and processes and events with non-W/Z isolated 

leptons are estimated from data with absolute systematic uncertainties of 0.5 and 0.3 events, 
respectively. The contribution to the systematic uncertainty on the cross section from the un- 
certainties on the background estimates is 11%. 
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The total systematic uncertainty on the measured cross section, dominated by the uncertainty 
on the estimated background yield, is 24 pb. An additional systematic effect of 21 pb, due to a 
11% relative uncertainty on the integrated luminosity measurement [5 J, is quoted separately. 

Taking into account the data yield, the background estimation, the branching fraction, the sig- 
nal acceptance and efficiency, the integrated luminosity, and all associated statistical and sys- 
tematic uncertainties, the top-quark pair production cross section is measured to be 

(7(pp tf) = 194 ± 72(stat.) ± 24(syst.) ± 21(lumi.) pb. 

An alternative analysis, exploiting jets constructed only from silicon tracker information [33j 
and without missing transverse energy requirements in the event selection, yields a similar 
cross section. The quoted measurement can be compared with the calculated NLO theoretical 
cross section of 157.5^244 pb for a top-quark mass of 172.5 GeV/c^ ||T6l[T7| . 

In conclusion, the first measurement at the LHC of the cross section for tt production has been 
completed. This measurement, made with an integrated luminosity of 3.1 ± 0.3 pb^^, is only 
the beginning of a rich top-quark physics program to be conducted at the CMS experiment. 
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